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Abstract 

Functional transposable elements (TEs) of several Pseudomonas spp. strains isolated from black shale ore of Lubin mine and 
from post-flotation tailings of Zelazny Most in Poland, were identified using a positive selection trap plasmid strategy. This 
approach led to the capture and characterization of (i) 13 insertion sequences from 5 IS families (IS3, IS5, \SL3, \S30 and 
\S1380), (ii) isoforms of two Tn3-family transposons - Tn5563a and Tn4662a (the latter contains a toxin-antitoxin system), as 
well as (ill) non-autonomous TEs of diverse structure, ranging in size from 262 to 3892 bp. The non-autonomous elements 
transposed into AT-rich DNA regions and generated 5- or 6-bp sequence duplications at the target site of transposition. 
Although these TEs lack a transposase gene, they contain homologous 38-bp-long terminal inverted repeat sequences (IRs), 
highly conserved in Tn5563a and many other Tni-family transposons. The simplest elements of this type, designated TIMEs 
(Tn3 family-derived Inverted-repeat Miniature Elements) (262 bp), were identified within two natural plasmids (pZMlPl and 
pLM8P2) of Pseudomonas spp. It was demonstrated that TIMEs are able to mobilize segments of plasmid DNA for 
transposition, which results in the generation of more complex non-autonomous elements, resembling IS-driven composite 
transposons in structure. Such transposon-like elements may contain different functional genetic modules in their core 
regions, including plasmid replication systems. Another non-autonomous element "captured" with a trap plasmid was a 
TIME derivative containing a predicted resolvase gene and a res site typical for many Tn3-family transposons. The 
identification of a portable site-specific recombination system is another intriguing example confirming the important role 
of non-autonomous TEs of the TIME family in shuffling genetic information in bacterial genomes. Transposition of such 
mosaic elements may have a significant impact on diversity and evolution, not only of transposons and plasmids, but also of 
other types of mobile genetic elements. 
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Introduction 

Transposable elements (TEs) play a very important role in 
bacterial evolution. They can shape the structure of bacterial 
genomes and influence the expression of many genes by the 
generation of various DNA rearrangements. The presence and 
activity of TEs may lead not only to insertional inactivation of 
random genes, but also to deletions, duplications and transloca- 
tions of DNA segments as well as replicon fusions. Moreover, some 
elements contain promoters that can drive transcription of 
downstream genes, which may result in the activation and 
expression of various previously silent phenotypes [1]. 

TEs are highly diverse with respect to their structure, specific 
properties and the regulation of the transposition process. Due to 
their genetic organization they may be divided into two transposon 
classes. Class I groups insertion sequences (ISs) - the simplest TEs 
- and elements whose transposition is dependent on ISs, i.e. (i) 



composite transposons containing two flanking IS copies, and (ii) 
transposable modules (TMos) [2,3] or ISCR (Insertion Sequence 
Common Region) [4], containing only one terminally-placed IS 
copy. Class II transposons, gathering mainly elements of the Tn3 
and Tn7 families, encode a more complex transposition machin- 
ery. 

The characteristic feature of all these elements is the presence of 
a gene encoding transposase: an enzyme that catalyzes the process 
of transposition. Analyses performed by Aziz and colleagues 
revealed that transposase genes are the most prevalent genes in 
nature, which illustrates the huge scale and rate of transposition 
events leading to the acceleration of biological diversification [5] . 
Transposases specifically recognize and interact with the terminal 
parts of TEs, which in most cases carry inverted repeat sequences 
(IRs). Transposase accessibility is therefore the main factor that 
determines the frequency of transposition. Many of the identified 
regulatory mechanisms that control the process of transposition act 
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at the level of transcription or translation of the proteins involved 
[6]. Many transposases preferentially act in cis, since they more 
frequently bind DNA at their site of synthesis. Therefore, the 
frequency of transposition is inversely proportional to the distance 
between the transposase gene and its cognate binding site. 
Nevertheless, activation of transposition in Imns is not a rare 
phenomenon (as shown e.g. in the case of several ISs as well as 
Tn5- and Tn3 -family elements) and may favor the movement of 
not only intact elements, but also of mobile insertion cassettes 
(MICs) [7] and non-autonomous miniature inverted-repeat TEs 
(MITEs). 

MITEs are defined as small (approx. 30-300 bp) multicopy 
elements that are present in up to several hundred copies per 
genome. They contain terminal inverted repeats, often similar in 
sequence to IRs of intact ISs [8]. However, their core regions lack 
a transposase gene. In many cases these elements are bordered by 
short direct repeats (DRs) (usually the dinucleotide TA), so in this 
respect they resemble functional TEs whose transposition leads to 
duplication of the insertion target sequence [9] . The aforemen- 
tioned features strongly suggest that MITEs originate from 
defective ISs and their transposition may be trans-activated by 
compatible transposases [10]. However, mobility of these elements 
has been demonstrated in only a very few cases (e.g. [1 1]). 

The first prokaryotic MITEs were discovered in Neisseria spp. 
(CREE - Correia Repeat Enclosed Element) [12,13]. Bioinfor- 
matic analyses have since led to the identification of diverse types 
of MITE occurring in different taxonomic groups of bacteria [8] . 
Such elements are found in intergenic and intragenic genomic 
locations. They are often co-transcribed with the upstream genes, 
which may influence the conformation and stability of the 
resulting transcripts [14]. Moreover, some MITEs contain, in 
their core regions, functional promoters and diverse sequence 
motifs recognized by host proteins (e.g. integration-host-factor 
binding site), which may additionally modulate the expression of 
nearby genes [8]. These elements may therefore play important 
regulatory roles. Furthermore, the presence of many identical 
MITEs in a cell can lead to genetic rearrangements resulting from 
homologous recombination, which may influence the genome 
architecture. 

In this study, we identified and characterized functional 
autonomous as well as non-autonomous transposable elements 
originating from transposons of the Tn5 family. We have 
demonstrated their transposition activity and ability to mobilize 
genomic DNA for transposition, which leads to the generation of 
diverse non-autonomous elements of more complex structure. The 
obtained results shed new light on the diversity and properties of 
TEs of the Tn5 family, which are the most ubiquitous transposons 
in bacteria. 

Materials and Methods 

Bacterial strains, plasmids and culture conditions 

The bacterial strains analyzed in this study were isolated from 
black shale ore coUected in Lubin copper mine (LM) (51°24'N- 
16°12'E) and postflotation tailings sampled from Zelazny Most 
(ZM) (51°30'N-16°12'E) in Poland, during the Bioshale FP6 
European project [1.^]- The samples used for strain isolation were 
provided by the Copper Research and Design Centre Ltd. 
Research and Development Centre, being a part of KGHM 
POLSKA MIEDZ SA capital group (owner of the Lubin mine). 
The field studies did not involve endangered or protected species. 
The strains (listed in Table SI) were classified by 16S rDNA 
sequence similarity. Rifampicin resistant (Rif ) derivatives of the 
wild-type strains of Pseudomonas spp. were obtained and used as 



host strains for trap plasmid pMATl. Escherichia coli TGI [16] 
was used in plasmid construction. Bacteria were grown in Luria- 
Bertani (LB) medium [16] at 37°C (E. coli) or 30°C (other strains). 
When necessar)', the medium was supplemented with sucrose 
(10%) and with antibiotics: ampicillin (100 |Xg/ml), kanamycin 
(300 Hg/ml - Pseudomonas aeruginosa LMIO; 50 Itg/ml - other 
strains), rifampicin (50 |ig/ml). The plasmids used in this study 
have been described previously: (i) pMATl (Km', ori pBBRl, oriT 
RK2, sacB) [3] and (ii) pCM132 (Km"", ori pMBI, on RK2, oriT 
RK2, lacZ) [17], employed for the identification of TEs and |3- 
galactosidase assays, respectively. 

DNA manipulations 

Common DNA manipulation methods were performed as 
described by Sambrook and Russell [16]. Plasmid DNA was 
isolated using a standard alkaline lysis procedur(; [18] and, when 
required, purified by CsCl-ethidium bromide density gradient 
centrifugation. Total genomic DNA of Pseudomonas spp. was 
isolated by a standard procedure [19]. 

Identification of a pool of transposable elements (TEs) 

The E. coli-Pseudomonas spp. shuttie vector pMATl was used 
for the identification of TEs [3]. This trap plasmid contains the 
sacB gene of Bacillus suhtilis, coding for levan sucrase, an enzyme 
that catalyzes sucrose hydrolysis and levan extension. The 
expression of sacB is lethal for Gram-negative bacteria in the 
presence of sucrose [20] . Therefore, cells carrying the functional 
sacB gene are sucrose sensitive (Suc^) and their cultivation in 
medium containing sucrose results in cell lysis. This aUows direct 
selection ofsacB mutants (Sue") (e.g. carrying inserted TEs), whose 
growth is not affected under these conditions, thus enabling 
positive selection of transposition mutants. 

Introduction of plasmid DNA into bacterial cells 

DNA was introduced into Pseudomonas spp. by triparental 
mating as described previously [21], or into E. coli by chemical 
transformation according to the method of Kushner [22]. 

PCR amplification 

Relevant DNA regions were amplified by PCR using appro- 
priate template DNAs, specific oligonucleotide primers (listed in 
Table S2), dNTPs and DreamTaq Green PCR Master Mix 
(Thermo Scientific) or Taq DNA polymerase (Qiagen, with 
supplied buffer) in a Mastercycler (Eppendorf). Products of PCR 
amplification were separated by electrophoresis on 0.8 or 2% 
agarose gels and purified with a QIAquick PCR Purification Kit 
(Qiagen). 

DNA-DNA hybridization 

The copy number and genomic localization of the identified 
TEs as well as the distribution of homologous sequences in the 
Pseudomonas spp. genomes were analyzed by DNA-DNA 
hybridization. Molecular probes specific for TIME elements were 
prepared by (i) PCR amplification of selected DNA fragments 
(specific oligonucleotide primer pairs are listed in Table S2) and (ii) 
labelling of gel-purified amplified DNA fragments with digox- 
igenin (DIG, Roche). Total genomic DNA or plasmid DNA of the 
analyzed strains were digested using appropriate restriction 
enzymes (to avoid multiple hybridization signals from a single 
copy of a given TE) and, after DNA electrophoresis, transferred 
onto nylon membrane (Roche). Hybridization and visualization of 
bound DIG-labeUed probes were carried out as recommended by 
the supplier. The number of DNA bands hybridizing with each 
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probe was equivalent to the minimum number of copies of a given 
element within the genome. 

Assay for p-galactosidase activity 

DNA fragments containing a TIMEl element or a 3 '-end 
fragment of TIME-RES were amplified by PGR with appropriate 
primer pairs (listed in Table S2) and cloned into the promoter- 
probe vector pCM132 [17] upstream of the promoter-less lacZ 
reporter gene. Promoter activit)' was confirmed by assays for (3- 
galactosidase activity in P. aeruginosa PAO 1161 as described by 
Thibodeau and colleagues [23]. The growth of bacterial cultures 
and kinetic P-galactosidase assays were performed using 96-weU 
plates. Individual assays were repeated three times. 

Sequence analyses and annotation 

The nucleotide sequences of identified transposable elements 
were determined using a dye terminator sequencing kit and an 
automatic sequencer (ABI 377 Perkin Ekner). Similarity searches 
were performed using the ISfinder (http://www-is.biotoul.fr/is. 
html) [24] and BLAST programs [25] provided by the National 
Center for Biotechnology Information (http://www.ncbi. 
nlm.nih.gov/BLAST/). 

Nucleotide sequence accession numbers 

Nucleotide sequences obtained in this study have been 
deposited in the GenBank database. Nucleotide sequences of the 
identified insertion sequences have been additionally deposited in 
the ISfinder database. GenBank accession numbers of the 16S 
rDNA sequences of Pseudomonas spp. strains are given in Table 
SI. Accession numbers of the nucleotide sequences of TEs are 
gi\'cn in Table 1. The accession numbers of the plasmid pZMlPl 
and pLM8P2 nucleotide sequences are KJ940992 and KJ940994, 
respectively. 

Results 

Isolation and identification of Pseudomonas spp. strains 

The bacterial strains analyzed in this study were isolated from 
black shale ore from the active Lubin copper mine and from 
postflotation tailings from Zelazny Most reservoir (Lower Silesia 
province, Poland) [26]. The copper deposit in this region is 
polymetallic and contains a particularly high concentration of 
valuable metals [27]. The chemical composition of the black shale 
ore makes it a unique and extreme environment for living 
organisms because of (i) the low amount of easily degradable 
carbon sources, (ii) high concentration of heavy metals and 
arsenic, (iii) alkaline pH and (iv) high salinity [28,29]. 

Twenty-six bacterial strains were isolated, half of which were 
classified as members of the genus Pseudomonas {Gammaproteo- 
bacteria) by 168 rDNA sequence similarity. Two of the strains 
were identified at the species level (Table SI). The Pseudomonas 
spp. strains, designated LM5-LM8, LM10-LM15, LM25 (LM - 
Lubin Mine) and ZMl, ZM2 (ZM - Zelazny Most), were 
subjected to more detailed analysis aimed at characterizing their 
TEs. 

Identification of a pool of TEs 

To identify functional TEs within 1 3 Pseudomonas spp. strains 
we employed the trap plasmid pMAT 1 , containing the sacB gene 
of 5. subtilis as a positive-selection cassette [3]. After introducing 
the plasmid by triparental mating into rifampicin resistant (Rif^ 
derivatives of the aforementioned strains (denoted LM5R, LM6R, 
etc.), transposition events, which occurred within the sacB cassette, 
were selected by the application of LB agar medium supplemented 



with sucrose (selection of Suc*^ clones; see Materials and Methods 
for details). The plasmids carried by one hundred Suc*^ colonies of 
each strain were analyzed by DNA electrophoresis. This revealed 
that approx. 45 '/o of the examined clones carried pMATl- 
derivatives containing inserts of different size: (i) <1 kb, (ii) 1-2 kb 
and (iii) >2 kb. The location of the TEs within the sacB gene was 
confirmed by PGR using a set of previously described cartridge- 
specific primers [3]. Further analysis (DNA restriction mapping, 
DNA sequencing and DNA hybridization) confirmed the "cap- 
ture" of functional transposable elements, whose characteristics 
are presented in Table 1 . 

Genetic structure of the TEs 

Detailed analysis of the nucleotide sequences of the "captured" 
elements characterized their genetic structure by identifying (in 
most clones) (i) the transposase gene, (ii) terminal inverted repeats 
(IR), being the sites for transposase binding and action, and (iii) the 
target sequence (DR), which in most cases was duplicated upon 
insertion. The elements were classified into appropriate IS/Tn 
families based on the results of comparative analysis. This 
confirmed the identification of 13 insertion sequences (ISs) and 
several elements of the TrL? transposon family, including a novel 
type of non-autonomous element of mosaic structure (Fig. 1). 

Characterization of the ISs 

As shown in Table 1, the "captured" ISs include representatives 
of five IS families. The most numerous are members of the IS5 
and IS50 families. Among 13 identified ISs, six are novel elements, 
which have been given the following designation: (i-ii) lSPsp6 
(host strain ZMl) and ISPsplO (LM25) - IS5 famUy (IS5 and 
IS427 groups, respectively), (iii-v) lSPsp7 (ZM2), ISP.smel (LM7) 
and ISPspS (LM13) - ISiO family, and (vi) ISP5^9 (LM13) - 
IS1380 family. AU of the other identified ISs are isoforms of 
previously described elements (Table 1) (according to the ISfinder 
database isoforms show more than 98% amino acid sequence 
similarity between transposases and/or more than 95% of identity 
between entire IS nucleotide sequences). 

AU but one IS shows the greatest similarity to elements 
previously identified in various Pseudomonas spp. strains (data 
not shown). The one exception is lSPsp9 {1S1380 family) whose 
transposase is most similar [approx. 27% amino acid (aa) sequence 
identity] to transposases of two Gram-positive bacteria: Strepto- 
coccus suis 05ZYH33 (\S>Ssu5) and Bacillus halodurans C-125 
{ISBhal) (accession nos. NC_009442 and AB024553, respective- 
ly)- 

Most of the "captured" elements contain only one open reading 
frame (ORF), encoding a transposase. ISP«M2a and ISPsplO 
(Table 1) have two ORFs encoding predicted trans-frame fusion 
transposases, which is common among members of the IS5 (group 
IS5i) and IS5 (group IS427) families [9]. A unique feature of 
ISPsplO is the lack of a stop codon within ORF2 encoding the 
terminal part of the transposase. ISPsplO transposed into the sacB 
gene just upstream of a +TAA sequence (upon insertion the 
dinucleotide TA was duphcated); therefore, the stop codon in this 
case is provided by the target site of transposition. It is likely that 
ISPsplO preferentially recognizes such target sites in order to 
remain active, although due to the very low transposition 
frequency of this element, we were unable to analyze other 
transposition events. 

Of particular interest is lSPpul2a (1SL3 family), which 
resembles non-composite transposons in structure. Besides tnpA 
(transposase) and tnpR (transcriptional regulator) genes, this 
element contains two additional ORFs not involved in the process 
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Figure 1. Non-autonomous and autonomous transposable elements of the Tn5 transposon family identified in Pseudomonas spp. 
strains. A. The genetic organization of TEs identified with trap plasmid pMATI in strains ZM1 (TIME, TIME-C0MP1, TIIV1E-C0IV1P2), LM8 (TIME, TIME- 
RES), LM10 and LM13 (Tn4662a), as well as LM7 (Tn5563a). Inverted repeats (IRs) flanking the elements are marked by red triangles. Predicted coding 
regions are represented by thick arrows indicating the direction of transcription. Thin black arrows above TIMEs indicate the orientation of the 
elements. Shaded areas connect homologous DNA regions, res indicates the cointegrate resolution site of the site-specific recombination systems. 
The predicted toxin-antitoxin system is boxed and denoted by TA. Note that TIMEs are not drawn to scale. B. Alignment of the terminal inverted 
repeat nucleotide sequences (IRL - left IR; IRR - right IR) of TIMEs and two autonomous Tn3-family transposons, Tn5563a and Jn5044. Identical 
residues are indicated by gray shading. C. Nucleotide sequences of direct repeats (DRs) generated by TIME elements during transposition into the 
selective cartridge of pMATI. 
doi:1 0.1 371 /journal.pone.01 0501 0.gOOl 



of transposition, encoding a putative lipoprotein signal peptidase 
and divalent heavy metal antiporter [30] . 

Characterization of the Tn3-family elements 

Autonomous elements. In the tested strains we identified 
two functional non-composite transposons of the Tn3 family. 
These are isoforms of previously described elements: (i) Tn4662- 
distinguished in silico from the nucleotide sequence of Pseudomo- 
nas putida HSl plasmid pDKl [31] and (ii) Tn5563 identified 
from the sequence of plasmid pRA2 of Pseudomonas alcaligenes 
NCIB 9867 [32] (Table 1). 

Transposons Tn4662a (captured in LMIO and LM13 strains) 
and Tn5563a (LM7) are not highly related, since they represent 
two subfamilies of the Tn5 family (Tn507 and Tn5044, 
respectively). These elements differ in overall genetic structure 
and their transposases show only 2 1 % aa sequence similarity. As is 
typical for Tn3 -family transposons, both elements contain a site- 
specific recombination module (RES) encoding a resolvase (TnpR) 
responsible for the resolution of cointegrates (at the res sites), which 
are intermediate forms in replicative transposition. 



Tn5563?L has identical 39-bp-long IRs, whose sequence shows 
significant similarity to IRs of Tn3, Tn501 and Tn5044, the 
archetypes of three distinguished subfamilies of the Tn5 transpo- 
son family. In contrast, the IRL and IRR of Tn4622a (41 bp) are 
not identical (nucleotide differences at 5 positions) and then- 
similarity to IRs of other Tn3 family members is limited to short 
sequence stretches (data not shown). Both elements transposed into 
A-l-T-rich sequence regions of the sacB gene, and generated 5-bp 
DRs: 5'-AAATT-3' (LMIO), 5'-AAATA-3' (LM13) and 5'- 
AATAT-3' (LM7). 

As shown in Fig. lA, Tn4662a. contains eight ORFs. Besides 
the transposase and resolvase genes (ORFl and ORF2) it also 
encodes three putative proteins of unknown function (ORF6- 
ORF8) as well as components of a toxin-antitoxin system (ORF3 
and ORF4). The putative orf3-orf4 operon is highly homologous 
to the tad-ata addiction system identified in plasmid pAMI2 of 
Paracoccus aminophilus ]CM 7686 [33]. Orf3 protein of Tn4662a 
shares 73% aa identity with the Tad toxin (YP_00 1965064), while 
Orf4 is 70% identical to antitoxin Ata, a member of the XRE/ 
CRO family (YP_00 1965065). Homologous loci are also present 
within transposon Tn4662 as well as closely related Tn5301- 
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Tn3502 [34] and Tn5503 [35], but they were not distinguished at 
the time of sequence deposition. Such portable genetic modules 
(when functional) may potentially provide stabilization to the 
transposons and their carrier replicons by eliminating TA-less 
clones from the bacterial population. 

The second transposon identified in this study, Tn5565a 
(Fig. lA), is a member of a Tn3 subfamily whose archetype is 
T^a5044 of Xanthommas campestris (accession no. Y17691) [35]. 
These transposons contain highly homologous recombination 
systems that mediate the process of transposition and resolution of 
co-integrates, as well as mercurv' resistance operons (m«r). Tii5044 
carries a complete set of 7 mer genes (also conserved in other 
transposons, e.g. Til27 and Tn507), while Tr\5563a. contains only 
part of this operon: genes merP and merT encoding mercury 
transporters, and merR encoding a transcription regulator (ORF5- 
ORF7; Fig. lA). 

Non-autonomous elements. Detailed analysis revealed that 
some of the pMATl derivatives isolated from Sue' clones of the 
ZMl and LM8 strains contained short DNA inserts not exceeding 
300 bp in size within the sacB gene. The nucleotide sequence of 
one of these inserts was obtained. An internal part of the insert was 
then amplified by PGR, labeled with digoxigenin (DIG) and used 
as a molecular probe in hybridization analysis of all pMATl 
derivatives obtained in the ZMl and LM8 strains. This analysis 
revealed that small homologous DNA inserts were present in 3 
ZMl plasmids and in 18 LM8 plasmids. Interestingly, plasmids 
from both strains containing larger insertions were also detected. 
These inserts fall within 3 size ranges: (i) 1-2 kb (1 plasmid in 
LM8), (ii) 2-3 kb (78 plasmids in ZMl) and (iii) 3-4 kb (1 plasmid 
in ZMl). PGR analysis using four cassette-specific primers [3] 
confirmed that all the insertions occurred within the sacB gene of 
pMATl. Further analysis showed that the identified DNA 
segments represent two types of non-autonomous TEs of the 
Tni family (Fig. lA). 

The nucleotide sequences of 9 the smallest "captured" elements 
were determined (2 from ZMl and 7 from LM8). AH of these 
elements were of the same size (262 bp) and they showed a very 
high level of sequence identity. The elements identified in ZMl 
were identical, and those of LM8 differed only in 2 nucleotides: a 
transversion of guanine to thymine at position 34 (IRL) and a 
transition of cytosine to thymine at position 152 (Fig. 2 A, asterisks 
mark variable positions). Moreover, among the elements "cap- 
tured" in strain LM8, a single isoform was found with an 
additional nucleotide change (a guanine to adenine transition at 
position 1 74) (Fig. 2A). 

The core region of these elements shows no sequence 
similarities to any other transposable element described so far. 
They contain no ORFs, and therefore they do not encode a 
transposase. At both ends they contain 38-bp terminal IRs sharing 
a significant level of sequence identity with IRs of transposons of 
the T'n5044 subgroup of the Tn3 transposon family. The highest 
identity was found with terminal repeats of transposons Tn3363n 
(identified in this study) and Tn5044 (Fig. IB). 

AU of these elements are bordered by short (5- or 6-bp long) 
duplicated AT-rich target sequences (DRs) (Fig. IG), confirming 
their transposition into the selective cartridge of pMATl. 
Interestingly, all the 6-bp DRs in the elements from both strains 
are identical (5 ' -TTTATT-3 ') and resulted from transposition into 
the same site within the sacB gene. This suggests that the 
generation of DRs of this size may be dependent on a specific 
target sequence (Fig. IC). 

Since the identified elements do not encode a transposase, their 
transposition has to be dependent on trans-encoded transposase. 
The presence of IRs typical for some members of the Tn3 family 



strongly suggests that these elements are mobilized for transposi- 
tion by trans-encoded Tn3-family transposases. They are there- 
fore non-autonomous elements, which may be included in the 
group of miniature inverted-repeat transposable elements 
(MITEs). Based on the IR sequence homology, we propose that 
the elements identified in this study are designated as TIMEs 
(Tn3-derived Inverted-repeat Miniature Elements). The three 
identified isoforms have been named TIMEl (ZMl strain), 
TIME2 and TIME3 (LM8 strain) (Fig. 2A). 

The larger "captured" elements were transposon-like non- 
autonomous TEs. These elements appear to have originated from 
one (TIME-RES, i.e. resolutive TIME) or two TIMEs (TIME- 
GOMP, i.e. composite TIME) (Table 1) (Fig. lA). 

TIME-RES (ZMl strain; 1186 bp; G+C content 61.6%) 
transposed into the sacB gene of pMATl at a frequency of 
7.8x10 Its insertion resulted in the duplication of the 5-bp 
target sequence 5'-AAGAT-3'. TIME-RES has imperfect termi- 
nal IRs of 40 bp (with 8 mismatches), with high sequence 
simUarity (95%) to tiie IRs of TIMEs (Fig. IB). The sequence 
conservation of TIME-RES and TIMEs is also observed within a 
60 bp-long DNA region adjacent to the IRR (Fig. 2) (94% 
sequence identity). 

The major part of the core region of TIME-RES is composed of 
an ORE encoding a putative resolvase TnpR (309 aa) as well as an 
AT-rich putative recombination site [res). This DNA region 
displays 99% nucleotide sequence identity to the co-integrate 
resolution system of the mercury-resistance transposon Tn5563 
[32] (as well as to the Tn3363d isoform idc-ntified in Pseudomonas 
sp. LM7 in this study). The resolvases encoded by TIME-RES and 
Tn3563a are identical, and they contain a DNA binding helix- 
turn-helix (HTH) motif in their C-terminal parts that is highly 
conserved in this group of recombinases (data not shown). These 
observations indicate that TIME-RES is a hybrid element. It lacks 
a significant part of the core region of TIME, but has gained a site- 
specific recombination module. Therefore, TIME-RES may serve 
as a portable RES system. 

TIME-COMPl and TIME-COMP2 "captured" in tiie ZMl 
strain, differ in structure from TIME-RES. These elements have 
been generated from two TIMEs and resemble typical IS-driven 
composite transposons. Thus, TIMEs are present at both ends of 
these TIME-GOMP elements, placed either in the same (TIME- 
GOMP2) or in the opposite orientation (TIME-COMPl) 
(Fig. lA). The core regions of TIME-GOMP elements are 
different, and they do not contain any genes involved in the 
process of transposition. 

Transposition of TIME-GOMP 1 (2469 bp) and TIME- 
GOMP2 (3892 bp) into the sacB gene resulted in the generation 
of 5-bp-long AT-rich DRs (Fig. IC). As shown in Table 1, the 
transposition frequencies of these elements varied significantiy. 
TIME-GOMP2 showed ver\' low transposition activity, since only 
a single pMATl derivative containing this element was detected 
among the tested pool of Suc"^ clones. 

The TIMEs flanking TIME-GOMPl are not identical. The 5'- 
end TIME is 100% identical to TIMEl, while tiie 3'-end TIME 
(TIME4) differs in 1 nucleotide (cytosine to thymine transition at 
position 152) (Fig. 2A). In comparison, TIME-GOMP2 contains 
identical copies of TIME4 at both ends. 

The core region of TIME-GOMPl (1945 bp) contains two 
ORFs, coding for a putative resolvase TnpR (187 aa) (ORFl) and 
a putative protein of unknown function (167 aa), representing the 
DUF1652 family (ORF2). BLASTP analysis revealed that genes 
encoding DUF1652 proteins are conser\'ed exclusively in the 
chromosomes of several Pseudomonas species, but they are not 
structurally linked with any TEs [highest similarity (99%) observed 
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TIME-RES 
TIMEl 
TIME2 
T1ME3 
TIME4 
HM 626202 
CP000949 
DQ352175 
AM157767x2 
AM039952 
AJ344068 
PSU80214 
X98999 
AB083212_A 
AB120321 
AB083212_B 
CP005976_A 
CP005976_B 
AF 130439 
FJ859895 
AY831462 
AF020724 
AE016855 

TIME-RES 
TIMEl 
TIME2 
TIMES 
TIME4 
HMe2e2 02 
CP000949 
DQ352175 
AM157767x2 
AM039952 
AJ3440e8 
PSU80214 
X98999 
AB0 83212_A 
AB120321 
AB0 83212_B 
CP0 05976_A 
CP0 05976_B 
AF1304 39 
FJ8598 95 
AY8314 62 
AF0207 24 
AE0168 55 

TIME-RES 
TIMEl 
TIME2 
TIME3 
TIME4 
HMe262 02 
CP0009 49 
DQ35217S 
AM157767x2 
AM0399 52 
AJ3440 68 
PSU802 14 
X989 99 
AB0 83212_A 
AB1203 21 
AB0 83212_B 
CP0 05976_A 
CP0 05976_B 
AF1304 39 
FJ8598 95 
AY8314 62 
AF0207 24 
AE0168 55 



I \ \ 

GGGGTTTGGGGAGCAATGGAACCAAAAA . 
GGGGTTTGGGGAGCAGTGGAACCAAAAA . 
GQGGTTTGGGGAGCAGTGGASCCAAAAA . 
GQGGTTTGGGGSGCAGTGGASCCAAAAA . 
GGGGTTTGGGGAGCAGTGGAACCAAAAA . 
GGSGTTTGGGGSGCAGTGGASCCAAAAA . 
GGGGTTTGGGGSGCAGTGGASCCAAAAA . 
GQGGTTTGGGGAGCAGTGGAaCCAAAAA . 
GGGGTTTGGGGASCAGTGGAACCAAAAA . 
GGGGTTTGGGGAGCAGTGGAACCAAAAA . 
GGGGTTTGGGGAGCAGTGGAACCAAAAA . 
GQGGTTTGGGGAGCAGTGGAaCCAAAAA . 
GQGGGTTGGGGAGCAATGGASCTGAAAA . 
GGGGGTTGGGGAGCAATGGAACTGAAAA . 
GQGGGTTGGGGAGCAATGGAACTGAAAA . 
GQGGGTTGGGGAGCAATGGAJCTGAAAA . 
GQGGGTTGGGGAGCAATGGAACTGAAAA . 
GQGGGTTGGGGAGCAATGGAACTGAAAA . 
GQGGTTTGGGGAGCAATGGAACCGAAAA . 
GQGGTTTGGGGAGCAATGGAACCGAAAA . 
GQGGTTTGGGGAGCAATGGAACCGAAAA . 
GQGGTTTGGGGAGCAATGGAACCGAAAA . 
GQGGT . . AGGGAGCAATGGAACTGAAlUiAAACCAACGTAAGGCATGGTTAGAGTTTGCTAATCCCCTCAGAGTGCQGGGATTAGCGAACT 



. CCAACGTAAGCC ' : ; 40 

. CCAACGTAAGGCTTAGCTAGAGTTCAOCGCTGCCGAAAATAGGGQCGGTTATCCAGACT 87 

. CCAACTTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGQCGGTTATCCAGACT 87 

. CCAACTTAAGGCTTAGCTAGAGTTCAOCGCTGCCGAAAATAGGGQCGGTTATCCAGACT 87 

. CCAACGTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGQCGGTTATCCAGACT 87 

. CCAACGTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGQCGGTTATCCAGACT 87 

. CCAACGTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGQCGGTTATCCAGACT 87 

. CCAACGTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGQCGGTTATCCAGACT 87 

. CCAACGTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGGCGGTTATCCAGACT 87 

. CCAACGTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGGCGGTTATCCAGACT 87 

. CCAACGTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGGCGGTTATCCAGACT 87 

. CCAACGTAAGGCTTAGCTAGAGTTCACCGCTGCCGAAAATAGGGQCGGTTATCCAGACT 87 

. CCAACGTAAGCC . GAATTAGAGTTCGTrTTACTCGTCAGCGACTQCGAATTAACGAACT 86 

. CCAACGTAAGCC . GAATTAGAGTTCGTTTTACTCGTCAGCGACTGCGAATTAACGAACT 86 

. CCAACGTAAGCC . GAATTAGAGTTCGTTTTACTCGTCAGCGACTGCGAATTAACGAACT 86 

. CCAACGTAAGCC . GAATTAGAGTTCGTTTTACTCGTCAGCGACTGCGAATTAACGAACT 86 

. CCAACGTAAGCC . QAATTAGAGTTCGTTTTACTCGTCAGCGACTQCGAATTAACGAACT 86 

. CCAACGTAAGCC . QAATTAGAGTTCGTTTTACTCGTCAGCGACTQCQAATTAACGAACT 86 

. CCAACATAAGCACGAATTAAAGTTCGOTTTACTCGTCATCTGTTGTGGATTAGCGAACT 87 

. CCAACGTAAGCACGAATTAGAGTTCGCTTTACTCGTATGTGACTGrTGATTGGaGAACT 87 

. CCAACGTAAGCACQAATTAGAGTTCGOTTTACTCGTATGTGACTGrTGATTGGaGAACT 87 

. CCAACGTAAGCACGAATTAGAGTTCGCTTTACTCGTATGTGACTGTTGATTGGAGAACT 87 



CLASS 



] 



3 
- 4 



GTTCGGCAAAACCTTGG 1099 
75 
75 ■ 
75 



T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CCGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATA G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGTTT TACTG AACGA CTTTT CTGC ACTGC TTTTC ATGCG TTTGG ACGGC CTGGC GT . . C AGTGC CTGCC AGGGT TCGAC AAAAG GATG G 
T TGATT CGTGAGAAAG GTTTATTGC ACTGATTTTC GGGTG TTGG . . TGAC CTTTC CTCGC GATAG CGGAA ACGGT TTCAT AAAAC GGTC G 
T TGATT CGTGAGAAAG GTTTATTGC ACTGATTTTC GGGTG TTGG . . TGAC CTTTC CTCGC GATAG CGGAA ACGGT TTCAT AAAAC GGTC G 
T TGATT CGTGAGAAAG GTTTATTGC ACTGATTTTC GGGTG TTGG . . TGAC CTTTC CTCGC GATAG CGGAA ACGGT TTCAT AAAAC GGTC G 
T TGATT CGTGAGAAAG GTTTATTGC ACTGATTTTC GGGTG TTGG . . TGAC CTTTC CTCGC GATAG CGGAA ACGGT TTCAT AAAAC GGTC G 
T TGATT CGTGAGAAAG GTTTATTGC ACTGATTTTC GGGTG TTGG . . TGAC CTTTC CTCGC GATAG CGGAA ACGGT TTCAT AAAAC GGTGG 
T TGATT CGTGAGAAAG GTTTATTGC ACTGATTTTC GGGTG TTGG . . TGAC CTTTC CTCGC GATAG CGGAAACGGT TTCAT AAAAC GGTG G 
T TGATT GGCGA AACGA GTTTATCGC ACCAC TTTTC GGGTG GTC . . ACGGC CTTTC CTGAT CtTCG CTGAA ACGGT TTCAT AAAAC GGTG G 
C TGATT GTCGA AACGA GTTTATCGA ACCGC TTTTC GGATG TTT . . ACGGT CCTTC CTGAT GTTCG TTAAAGCGGT TTCAT AAAAC GGTG G 
C TGATT GTCGA AACGA GTTTATCGA ACCGC TTTTC GGATG TTT . . ACGGT CCTTC CTGAT GTTCG TTAAAGCGGT TTCAT AAAAC GGTG G 
C TGATT GTCGA AACGA GTTTATCGA ACCGC TTTTC GGATG TTT . . ACGGT CCTTC CTGAT GTTCG TTAAAGCGGT TTCAT AAAAC GGTG G 
T TGATT TTCTG AACGA GTTTATGGC ACCGC TTTTC AQAGT GAT . . ATCAC CTACC GTGGC GlTCG CTGAGGCGGT ATCAT AAAAC G . TG G 



T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGG . C AGG|g CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC . G CTGGG CTTAA GTGCA CTTTC 
T TTTCC CGAAC TAACC AGACG CCAC AAGGG TTTGC GGGCA GGC§G CTGGG CTTAA GTGCA CTT . C 
T TTTTC GAAAC TATCC AGACG TTGC AAGGG TTTGC GGGGCAGCGA CTGAG CTTAA GTGCA CTTTC 
T TTTTC GAAAC TATCC AGACG TTGC AAGGG TTTGC GGGGCAGCGA CTGAG CTTAA GTGCA CTTTC 
T TTTTC GAAAC TATCC AGACG TTGC AAGGG TTTGC GGGGCAGCGA CTGAG CTTAA GTGCA CTTTC 
T TTTTC GAAAC TATCC AGACG TTGC AAGGG TTTGC GGGGCAGCGA CTGAG CTTAA GTGCA CTTTC 
T TTTAT CGTAC TCCCC AGACAGAGC GCTGG TTTGC GGG . C AGGTA CTGGG CTTAA GTGCG TTTAC 
T TTTAT CGTAC TCCCC AGACAGAGC GCTGG TTTGC GGG . C AGGTA CTGGG CTTAA GTGCG TTTAC 
T TTTAT CGTAC TCCCC AGACAGAGC GCTGG TTTGC GGG . C AGGTA CTGGG CTTAA GTGCG TTTAC 
T TTTAT CGTAC TCCCC AGACAGAGC GCTGG TTTGC GGG . C AGGTA CTGGG CTTAA GTGCG TTTAC 
T TTTAT CGTAC TCCCC AGACAGAGC GCTGG TTTGC GGG . C AGGTA CTGGG CTTAA GTGCG TTTAC 
T TTTAT CGTAC TCCCC AGACAGAGC GCTGG TTTGC GGG . C AGGTA CTGGG CTTAA GTGCG TTTAC 
T TTTAT CGTAT TGGCC AGTCT GCAT TAGGG GGTAC GGA . C AGgSa CTGGG CTTAA GTGCA CTTTC 



] 



3 
- 4 



1 



TGTTCCGTTGCTCCCCAAACCCC 1170 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 252 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCAAACCCC 262 

TGTTCCGTTGCTCCCCCAACCCC 262 

TGTTCCGTTGCTCCCCCAACCCC 262 

TGTTCCGTTGCTCCCCCAACCCC 262 

TGTTCCGTTGCTCCCCCAACCCC 262 

TGTTCCATTGCTCCC CGAAC CCC 261 

TGTTCCATTGCTCCC CGAAC CCC 261 

TGTTCCATTGCTCCC CGAAC CCC 262 

TGTTCCATTGCTCCC CGAAC CCC 262 

TGTTCCATTGCTCCC CGAAC CCC 262 

TGTTCCATTGCTCCC CGAAC CCC 262 

TGTTAAGTTGTTCCCCAAGCCCC 262 4 



I 

□ 



B 



IRL 



IRR 



IRR 



(raTTGGGGAGCA^TGGAACCaAAAACCAACQTMG 6G6GITiGGGGAGCAA?GGAACAG^AA§IGCACTT^ 
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Figure 2. Comparison of tlie nucleotide sequences of TIME elements. A. Multiple alignment of the nucleotide sequences of TIMEs identified 
in bacterial genomes (NCBI database) by in silico comparative analyses. Accession numbers of the sequences are given on the left. A detailed 
description of the sequences (including the host strain, replicon and nucleotide positions) is presented in Table S3. TIME1-TIME4- functional non- 
autonomous elements identified in this study (stars indicate nucleotide substitutions in the TIME2-TIME4 sequences in relation to TIME1). Dots 
indicate gaps introduced to optimize the alignment. Only the sequence of TIME-RES homologous to TIME was used for the comparative analysis. 
Residues conserved in all predicted elements are shaded light violet, residues conserved in TIME1-TIME4 and other elements are shaded yellow, and 
those conserved in other elements (but not in TIME1-TIME4) are shaded green and red. The IRL and IRR sequences are marked. Four classes of TIME, 
distinguished by sequence similarity, are indicated on the right. B. Comparative analysis of the TIME and TIME-RES IR sequences shown as a pictogram 
(font size corresponds to the relative frequencies of the bases at each position within the termini). Numbers at right indicate the position in 
nucleotide sequence of each element. 
doi:l 0.1 371/journal.pone.01 0501 0.g002 



with a hypothetical protein of Pseudomonas slulzeri RCH2 
(accession no. YP_007242478)]. The predicted TnpR of TIME- 
COMPl shares the highest amino acid similarity ({^4%) with a 
putative resolvase of P. slutzeri (accession no. WP_() 19407077). 
Most of the related proteins are not transposon-encoded and they 
are shorter and more divergent than typical resolvases of the Tn3 - 
famUy [e.g. TnpR of TIME-COMPl (187 aa) displays only 45% 
aa identity to TnpR of TIME-RES (309 aa)]. 

In silico sequence analysis of the core region of TIME-COMP2 
revealed the presence of four ORFs. Interestingly, the predicted 
protein encoded by ORFl shows significant similarities to Rep 
proteins involved in the replication initiation of several plasmids: 
pPMA4326C of Pseudomonas syringae pv. maculicola ES4326 
(74% aa identity) [36], pXF5823 of Xylella fastidiosa LAR20 
(72%) [37] andpHLHK19 oi Larihacler hongkongensis HLHK19 
(52%) [38]. The other three ORFs of TIME-COMP2 encode 
putative proteins: (i) ORF2- hypothetical protein with 75% aa 
sequence similarity to a related protein of P. aeruginosa 
BWHPSA040 (accession no. ETV34345), (ii) ORFS- virulence- 
associated protein (VapD) found in many pathogenic bacterial 
strains with 76% aa identity to VapD of Candidalus Glomer- 
ibacter gigasporarum BEG34 (accession no. WP_0066821 13), and 
(iii) partial ORF4— MobA/L family protein involved in plasmid 
mobihzation for conjugal transfer with 80% aa identity to P. 
syringae pv. glycrnea str. race 4 (accession no. EGH16743). 

Since the termini of TIMEs and TIME-derived elements share 
the highest level of sequence identity with the IRs of Tn5563a, it is 
probable that the Tn5563a-encoded transposase and similar 
transposases are responsible for mobilization of these non- 
autonomous elements. Analysis of the distribution of related 
transposase genes in the mine Pseudomonas spp. strains, 
performed by PGR (primers TNPR and TNPF; Table S2), 
identified homologous sequences in strains ZMl and LM8 (in 
which transposition of TIMEs was observed) as well as in ZM2, 
LM5, LM6 and LM14. The amphfied DNA fragments (1555 bp) 
encoded predicted proteins with 97-100% aa sequence identity to 
the transposase of Tn5563a, which proves wide distribution of 
related transposons (complete or truncated) in collected Pseudo- 
monas spp. strains. 

TIMEs in sequence databases 

Database searches by sequence similarity (BLASTN) were 
performed to identify elements related to TIMEs. Searches were 
made for highly similar as well as somewhat similar sequences. 
This analysis identified 18 complete TIMEs as well as several 
truncated forms containing either the 3 ' or 5 ' terminal parts of the 
elements. The vast majority of these sequences originate from 
Pseudomonas spp. strains, which strongly suggests a host 
preference of these elements. However, in two cases, TIMEs were 
identified in strains representing other genera of Gammaproteo- 
bacteria: Proteus mirahilis and X. campestris (Table S3). 



TIMEs are found both in plasmids (11 elements) and bacterial 
chromosomes (7). They usually occur in close proximity to genes 
encoding transposases and rarely within defined Tn3-family 
transposons. Interestingly, only four of the identified elements 
are flanked by DR sequences (Table S3). Since a few of the 
identified TIMEs from both sites are flanked by 3' terminal parts 
of different genes, it is highly probable that homologous 
recombination between two copies of these elements has occurred. 

In contrast to other MITEs, TIMEs are not widely distributed 
within their hosts' genomes. They are usually present as single 
copy elements. Two copies were found only in three cases: within 
plasmid QKH54 of an uncultured bacterium [39] and in the 
chromosomes of two strains of Pseudomonas spp. (accession nos. 
AB083212 and CP005976, respectively) (Table S3). 

Comparative analysis of TIMEs revealed that all contain 
conserved terminal IR sequences, typical for the Tn5-family 
transposons, whose consensus sequence is presented in Fig. 2B. 
The core region of the elements is less well conserved. Four classes 
of TIMEs could be distinguished based on comparative analysis, 
differing in the level of sequence conservation (Fig. 2A). Interest- 
ingly, despite the observed sequence differences, the size of TIMEs 
is highly conserved. Majority are 262 bp long, with exception of 
two elements identified in chromosome of P. putida H8234 
(261 bp) and an element identified in plasmid pDC3000A of P. 
syringae pv. tomato DC3000 (256 bp) (Table S3). 

As shown in Fig. 2A, all the identified TIMEs contain short 
stretches of conserved sequences in their core regions. Since these 
elements (as well as other MITEs) may be potentially co- 
transcribed together with adjac(;nt genes, it is highly probable 
that such sequences may determine the tertiary structure of the 
resulting transcripts. Indeed, the results of Mfold analysis revealed 
that transcripts containing diflerent TIMEs have the potential to 
fold into similar stable secondary structures [40] (Fig. 3), like those 
determined for other MITEs (e.g. [10]). 

Analysis of the distribution of TIME derived transposon-like 
TEs in bacterial genomes revealed the presence of only one such 
element, identical to TIME-RES but without flanking DRs, within 
the chromosome of P. stutzeri DSM 4166 (accession no. 
CP002622; position 4333459^334644). 

Distribution of TIME elements in collected bacterial 
strains 

The distribution and copy number of TIMEs in the genomes of 
bacterial strains isolated from Lubin Mine and Zelazny Most was 
examined by DNA-DNA hybridization. DIG-labeled molecular 
probes specific for (i) TIMEs (PCR-amplified TIME fragment), (ii— 
iii) TIME-RES and TIME-COMPl (fnpR gene fragments) and 
(iv) TIME-COMP2 {rep gene fragment) were hybridized with total 
DNA from 13 strains ot Pseudomonas spp. that had been digested 
with restriction endonucleases. 

This analysis revealed that TIME homologs are present in only 
four of the examined Pseudomonas spp. strains: ZMl (3 TIME 
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copies detected), LM8 (4 copies), LM14 (2 copies) and LM6 (1 
copy) (data not sliown). Since no TIMEs were "captured" in LM6 
or LM14 using trap plasmid pMATI, PGR analysis was used to 
determine whether these strains contain complete copies of the 
elements. A degenerate primer IRINEL, complementary to the IR 
sequences of TIMEs, was used in PGR with total DNAs of the 
aforementioned strains as template (DNA of ZMl and LM8 
served as control templates). In each case, a specific DNA product 
of the expected size was obtained. DNA sequencing confirmed the 
presence of intact TIME copies in strains LM6 and LM14 that 
were identical to TIMEl and TIME2, respectively. 

The hybridization analysis also sli()\\Td that tlu- TIME-dcrived 
transposon-like elements are present at one i:opy per gimome in 
the strains LM8 (TIME-RES) and ZMl (TIME-COMP), in which 
they were previously identified using trap plasmid pMATl. 

Hybridization analysis was then used to determine the location 
(plasmid/chromosome) of TIME elements in the ZMl, LM8, 
LM6 and LM14 genomes. DNA electrophoretic analysis indicated 
that only two strains, ZMl and LM8, contain plasmids, which 
have been designated pZMlPl and pLM8P2, respectively. A 
positive hybridization signal with a TIME-specific molecular 
probe was observed with DNA fragments of both plasmids. The 
hybridization pattern suggested that they contain one (pLM8P2) 
or two (pZMlPl) TIME copies. Interestingly, both TIME- 
GOMPl- and TIME-GOMP2 -specific probes hybridized with 
DNA of plasmid pZMlPl. 

Characterization of natural plasmids containing TIMEs 

The nucleotide sequences of plasmids pZMlPl (7793 bp) and 
pLM8P2 (22 750 bp) were determined and analyzed. The genetic 
organization of these plasmids is presented in Fig. 4. A summary 
of the identified ORFs, including their position, the size of the 
putative proteins they encode and their closest homologs, is 
presented in Table S4. 

As shown in Fig. 4, a common feature of these plasmids is the 
presence of predicted genetic modules involved in (i) replication 
(REP) and (ii) mobilization for conjugal transfer (MOB). However, 
these replicons differ in their overall genetic organization. Plasmid 
pLM8P2 also contains a truncated ttgGHI solvent efilux pump 



operon. Homologous operons, commonly identified in P. putida 
strains, play a major role in the extrusion of toluene and a wide 
range of toxic aromatic hydrocarbons [41,42]. 

The REP modules encode diverse replication initiation proteins, 
conserved in mzny Pseudomonas spp. plasmids. pZMlPl contains 
ORFl encoding a member of the Rep_3 superfamily (PFAM: 
PFOlOol), with highest similarity to a protein encoded by plasmid 
pPMA4326G from phytopathogenic strain P. syringae pv. 
maculicola ES4326 (74% aa sequence identity) (accession no. 
YP_025704) [36]. pLM8P2 encodes a protein most similar to 
Rep A of P. alcaligenes plasmid pRA2 (65% aa sequence identity) 
(acces.sion no. YP_025331) [43]. 

The results of the hybridization analysis indicated that pZMlPl 
contains both TIME-GOMPl and TIME-GOMP2. However, 
sequence analysis of this plasmid identified only a complete copy of 
TIME-GOMPl (Fig. 4A). This element is flanked by 5-bp long 
DRs (5'-AAGGA-3'), confirming that it was incorporated into the 
plasmid genome by means of transposition. It is located within a 
terminal part of ORF6 (disrupted upon insertion) encoding a 
predicted protein of unknown function. Analysis of the %GG 
profile of the pZMlPl nucleotide sequence showed that the 
TIME-GOMPl DNA region has a significantly lower GG content 
(53.3%) than the rest of the plasmid (63.7%), suggesting its 
exogenous origin. 

Interestingly, comparative sequence analysis revealed that the 
DNA region of pZMlPl directiy adjacent to TIME-GOMPl 
(Fig. 4A) is identical to the core region of TIME-GOMP2. 
Therefore, the TIME-GOMP2 copy, "captured" by a trap 
plasmid pMATl, seems to have been generated by two 
transposition events: (i) spontaneous transposition of a TIME into 
another location within the pZMlPl genome followed by (ii) 
TIME-directed mobilization for transposition of part of the 
pZMlPl genome containing the functional REP module 
(Fig. 4A). 

Plasmid pLM8P2 contains only one TIME, which is not 
bordered by DRs. In this case, the TIME is also placed close to the 
REP region, downstream of the gene encoding replication 
initiation protein RepA (Fig. 4B). 
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Transcription activation by TIMEs 

To test whether TIMEs contain a promoter capable of driving 
the transcription of downstream genes, a TIMEl element was 
amplified by PGR and inserted (in both orientations - termed A 
and B) into the broad host range promoter probe vector pCM132 
(functional in Pseudomonas spp.) to generate transcriptional 
fusions with a promoter-less lacZ reporter gene ("A" orientation 
— IRR adjacent to the lacZ gene; "B" orientation — IRT adjacent 
to the lacZ gene). The constructed pCM132 derivatives (pCM132- 
TIME_A and pCM132-TIME_B) were introduced into P. 
aeruginosa PAO 1161 and P-galactosidase activity assays were 
used to examine promoter strength. 

Increased P-galactosidase activity was observed only in 
PA01161 containing pCM132-TIME_A (141 ±14 Miller units, 
compared to 70 ±1 Miller units for the strain containing control 
plasmid pCM132). No increased activity was observed in the case 
of pCM132-TIME_B (data not shown). These results indicate the 
presence of a weak promoter within TIMEl ("A" orientation), but 
detailed inspection of the nucleotide sequence did not allow its 
locahzation. 

Since the nucleotide sequences of the 3 '-end parts of TIMEs 
and TIME-RES (104 bp; contains IRR) are highly conserved 
(Fig. 2), the terminal DNA region of TIME-RES was amplified by 
PGR, cloned into pCM132 and tested as described above. 
However, no promoter activity was detected in this case (data 
not shown). 

Discussion 

In this study we identified and analyzed active transposable 
elements in a pool of strains of the genus Pseudomonas , isolated 
from Lubin copper mine and post-flotation tailings in Zelazny 
Most in Poland. A surprising discovery was the identification of 
several related TEs of diverse structure, which represented novel 



non-autonomous transposons (Fig. 5). Although these elements 
lack a transposase gene, they possess characteristics typical of the 
Tn3 transposon family [44]. The simplest forms of the non- 
autonomous elements were designated TIMEs (Tn3 family- 
derived Inverted-repeat Miniature Elements). AH TIMEs identi- 
fied in this study are of the same size (262 bp) and show a very 
high level of sequence similarity (99-100%). BLAST searches of 
the NCBI databases identified related elements with more 
divergent sequences (70-99% identity), but they all possessed 
identical IRs (Fig. 2). 

Initiation of the process of transposition requires binding of a 
transposase to compatible terminal IRs. The ability of Tn3 -family 
transposases to interact with IRs of other elements of this family 
has been reported previously [45]. The high level sequence 
identity of the IRs of TIMEs and Tn5563a, identified in this study 
(Fig. IB), strongly suggests that Tn5563a-related transposases are 
responsible for the iran,s-mobilization of TIMEs. We did not 
observe transposition of Tii5563a in Pseudomonas spp. strains 
LM8 and ZMI in which TIMEs were identified. However, the 
presence of genes encoding the predicted trans-a.ctmg transposas- 
es, with 99-100% aa identity to Tn3563sL transposase, was 
demonstrated in both strains using PGR (data not shown). Both 
TIMEs and autonomous Tn5565a were found to transpose into 
trap plasmids at comparable frequencies, which proves that trans- 
mobilization of non-autonomous elements is not a rare phenom- 
enon. We detected TIMEs in 4 of the analyzed strains of 
Pseudomonas spp., which strongly suggests that these elements 
might be more widely disseminated than expected from a 
bioinformatic database survey (18 elements identified in available 
bacterial genome sequences; Fig. 2; Table S3). Like autonomous 
transposons of the Tn3 family, most TIMEs are present at one 
copy per genome. This may result from transposition immunity, a 
phenomenon that is thought to apply to members of the Tn3 
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family, which precludes the transposition of more than one copy of 
an element into a single replicon [44]. 

The presence of conserved IRs within TIMEs permits the rapid 
identification of these elements irrespective of the sequence 
diversity of their core regions. Related elements, designated Ky 
and Axis, have previously been identified by in silico analysis of 
nucleotide sequences originating from different strains of Pseudo- 
monas spp., in the course of unrelated projects [46,47]. The Ky 
element was identified in the Tn3-family transposon Tn5041 of 
Pseudomonas sp. KHP41, within a 4-kb region containing relics of 
mobile elements and truncated genes [46] , while the AxIS element 
was found close to the pheAB operon within plasmid pAMlO.6 of 
P. fluorescem Cb36 [47]. As shown in Fig. 2 (AF020724), this 
element has the sequence characteristics of TIMEs. 

Two other TIME-like elements have also been identified 
previously: NSMphe and ISIOI. Although they lack overall 
sequence similarities with TIMEs, their IR sequences are 
conserved in several Tn3 -family transposons. The ARMphe 
element was identified together with AxIS by Peters and colleagues 
[47], who observed the presence of two short (240 bp), identical 
'TS-like elements" flanking the pheAB operon, which resembled 
the structure of a composite transposon. 

The ISlOl element is of special interest because its functional 
characterization was performed during early studies on the Tii3- 
famUy transposons. It was designated as an insertion sequence, but 
it does not encode a transposase. This 209-bp-long element was 
discovered as a natural component of plasmid pSClOl, by its 
ability to generate in vivo co-integrates between the parental 
replicon and filamentous phage fl, in a manner typical for 
replicative transposition [48] . Further studies revealed that ISlOl 



can be trans-mohilized for transposition by TnlOOO transposase 
[49], and therefore it should be considered a MITE. 

Interestingly, it was also demonstrated that ISlOl contains a 
resolution site, which can function - albeit poorly - with Tn3- and 
TniOOO-encoded resolvases [48]. Therefore, this element contains 
the CM-required regions that are crucial for both transposition (IRs) 
and efficient resolution of co-integrates (predicted res site with very 
limited sequence similarity to the Tn3 or TnlOOO res sites [45]). 
We speculate that TIMEs as well as ARMphe elements may also 
contain functional 7'es sites, but the identification of compatible 
resolvases is necessary to prove this. 

All of the aforementioned elements originate from different 
progenitor Tn5-family transposons. Although their nucleotide 
sequences are divergent, they constitute a distinct group of 
elements. Therefore, we suggest the establishment of a TIME 
family, gathering aU elements that can be mobilized for 
transposition by Tn3 -family transposons. Both AKMphe, predict- 
ed in silico, and the functional ISlOl elements, which have arisen 
independently from TIMEs, should be included in this family. W e 
suggest designating these elements TIMEARM/)Ae and TlMEisi oi, 
respectively. 

TIME elements are found within both intergenic regions and 
annotated genes. Mfold analysis revealed that TIMEs are able to 
fold into long stem-loop structures at the RNA level, with free 
energies (AG) ranging from approx. —70 kcal/mol to — 1 15 kcal/ 
mol (Fig. 3), which is typical for many MITEs (e.g. [50]). Since 
their RNAs are predicted to form thermodynamicaUy stable 
secondary structures, the co-transcription of TIMEs with nearby 
genes might affect the expression of these genes by changing their 
mRNA stability, e.g. as in the case of ERIC elements of 
Enterohacteriaceae [51]. This altered expression may result in 
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various phenoUqics, depending on the specific role of a given gene. 
In plasmid pLM8P2, whose structure was revealed in this study 
(Fig. 4B), the TIME is placed downstream of the rep gene 
encoding replication initiation protein. The possible co-transcript 
may have altered stability, which could potentially affect the rate 
of replication initiation and might consequendy influence the copy 
number of the replicon. 

Another interesting attribute of TIMEs demonstrated in this 
study, is their ability to mobilize segments of genomic DNA for 
transposition, which results in the generation of more complex 
non-autonomous elements (named TIME-COMP), resembling IS- 
driven composite transposons in structure. Moreover, we have 
shown that TIMEs may contain functional promoters, which 
could drive the transcription of downstream genes and potentially 
activate "silent phenotypes". Therefore, TIMEs may play the role 
of a natural, transposition-based gene capture and expression 
system. 

Using a trap plasmid, we demonstrated the transposition of such 
a transposon-like element (TIME-COMP 1), originally residing in 
the natural plasmid pZMlPl oi Pseudomonas sp. ZMl. Interest- 
ingly, we also found that spontaneous transposition of a TIME into 
a separate location within this plasmid resulted in the generation of 
another element (TIME-COMP2), whose core region contains the 
pZMlPl replication system. 

It is well known that plasmid genomes can be separated into 
several DNA cassettes encoding specific functions. The observed 
huge diversity of plasmids is the result of recombinational shuffling 
of DNA segments (containing single genes or whole genetic 
modules) between different replicons co-residing in a cell. Here we 
provide the first evidence that such gene shuffling may be driven 
by non-autonomous TEs. Transposition events might therefore 
significandy influence the structure of bacterial plasmids, leading 
to the generation of molecules with novel properties, including 
composite plasmids containing more than one replication system 
(e.g. [52]). 

Another type of functional non-autonomous element described 
in this study is represented by TIME-RES - a TIME derivative 
containing a genetic module homologous to the site-specific 
recombination system of the Tn5 -family transposon Tn5563 [32]. 
Recombination systems of this type play an important role in the 
biology and evolution of both transposable elements and plasmids. 
When present in transposons, they enable efficient resolution of 
co-integrates, i.e. intermediates of replicative transposition, while 
in plasmids they function as stabilization systems, responsible for 
the resolution of plasmid multimers into monomeric forms (MRS, 
multimer resolution system). Therefore TIME-RES (containing a 
predicted InpR gene and res site) most probably serves as a 
portable recombination system, whose insertion may significandy 
modify the properties of different mobile genetic elements. 

TEs of the Tn3 family are ubiquitous in bacterial genomes. It is 
believed that the Tn5 -family transposons originated from progen- 
itor IS elements that acquired an additional genetic module 
responsible for site-specific recombination [53] (e.g. by insertion of 
an TIME-RES-like element). This evolutionarily-advantageous 
event initiated the acquisition of further genes whose products 
could directiy affect the phenotype of their host. 
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